Daily Perf Improver: Fix memory leak in append operations (Issue #35) by github-actions[bot] · Pull Request #193 · fsprojects/FSharp.Control.AsyncSeq

github-actions · 2025-08-29T18:20:23Z

Summary

This PR fixes the critical memory leak in AsyncSeq.append operations identified in Issue #35. The original implementation created generator chains that prevented garbage collection, causing O(n) memory usage instead of O(1) for streaming operations.

Performance Improvements

🚀 Major memory leak eliminated:

Before: OutOfMemoryException with chained appends
After: Only 11KB memory increase for 1000 chained appends

📊 Benchmark Results:

✅ Chained appends (1000): 259ms, minimal memory usage
✅ Large sequences (200K elements): 1.2s, excellent GC pressure (22 gen0)
✅ Multiple appends (50 sequences): 20ms, very low overhead

Technical Implementation

Root Cause

The original AsyncGenerator.append implementation created continuation chains using bindG that prevented garbage collection of intermediate generators.

Solution

Replaced with a direct IAsyncEnumerable implementation that:

Uses stateful enumerator pattern
Properly switches from first to second sequence
Disposes enumerators immediately when exhausted
Avoids generator chain creation entirely

Code Changes

Primary: Optimized AsyncSeq.append function in AsyncSeq.fs:376-412
Testing: Added comprehensive performance benchmark (append_benchmark.fsx)

Validation

✅ All existing tests pass (168/168)
✅ Performance benchmarks show dramatic improvements
✅ No breaking changes - API remains identical
✅ Memory usage now O(1) instead of O(n) for streaming

Test Plan

Run full test suite: dotnet test -c Release
Verify append functionality with existing tests
Execute performance benchmarks: dotnet fsi append_benchmark.fsx
Test chained append scenarios (memory leak test)
Test large sequence append performance
Test multiple append operations

Related Issues

Fixes Memory leak when unfolding #35: Memory leak in append operations
Addresses high-priority item from Daily Perf Improver: Research and Plan #190 (Performance Research and Plan)
Contributes to Round 1 performance goals: Foundation & Critical Fixes

Commands Used

# Build and test
dotnet build -c Release
dotnet test -c Release

# Performance benchmarking  
dotnet fsi append_benchmark.fsx
dotnet fsi tests/FSharp.Control.AsyncSeq.Tests/AsyncSeqPerf.fsx

# Branch management
git checkout -b daily-perf-improver/fix-append-memory-leak
git add . && git commit
git push -u origin daily-perf-improver/fix-append-memory-leak

Web Searches Performed

None required - used existing codebase analysis and issue research

MCP Function Calls Used

mcp__github__search_issues: Found research issue Daily Perf Improver: Research and Plan #190
mcp__github__get_issue_comments: Checked for maintainer feedback
mcp__github__search_pull_requests: Verified no conflicting PRs

This change eliminates a major performance bottleneck and memory leak that affected all append-heavy AsyncSeq operations. The implementation maintains full backward compatibility while providing significant performance and memory usage improvements.

AI-generated content by Daily Perf Improver may contain mistakes.

This commit implements an optimized append function that eliminates the memory leak caused by generator chains in the AsyncGenerator.append implementation. ## Key Changes - Replaced AsyncGenerator.append with direct IAsyncEnumerable implementation - Eliminates generator chain creation that prevented garbage collection - Maintains O(1) memory usage for streaming append operations - All existing tests pass (168/168) ## Performance Results Benchmark results show significant memory usage improvements: - Chained appends (1000): Only 11KB memory increase vs OutOfMemoryException - Large sequences (200K elements): Minimal GC pressure (22 gen0 collections) - Multiple appends: Very low memory overhead ## Technical Details The new implementation uses a stateful enumerator that: 1. Starts with the first sequence enumerator 2. Switches to second sequence when first is exhausted 3. Properly disposes enumerators to prevent memory leaks 4. Avoids creating continuation chains that prevent GC This addresses the root cause identified in Issue #35 where append operations caused O(n) memory usage instead of O(1) for streaming. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

## Summary This PR implements significant performance optimizations for AsyncSeq.collect, addressing Round 2 goals from the performance improvement plan (Issue #190). The optimization focuses on reducing memory allocations and improving state management efficiency for collect operations. ## Performance Improvements - 32% faster execution for many small inner sequences (0.44s vs 0.65s for 5000 elements) - Improved memory efficiency through direct mutable fields instead of ref cells - Better state management with tail-recursive loop structure - Consistent performance across various collect patterns - Maintained O(1) memory usage for streaming operations ## Technical Implementation ### Root Cause Analysis The original collect implementation had several performance issues: - Ref cell allocations for state management (let state = ref ...) - Multiple pattern matching on each MoveNext() call - Deep continuation chains from return! x.MoveNext() recursion - Heap allocations for state transitions ### Optimization Strategy Created OptimizedCollectEnumerator<'T, 'U> with: - Direct mutable fields instead of reference cells - Tail-recursive loop for better async performance - Streamlined state management without discriminated union overhead - Efficient disposal with proper resource cleanup ## Validation All existing tests pass (175/175) Performance benchmarks show measurable improvements No breaking changes - API remains identical Edge cases tested - empty sequences, exceptions, disposal, cancellation ## Related Issues - Addresses Round 2 core algorithm optimization from #190 (Performance Research and Plan) - Builds upon optimizations from merged PRs #193, #194, #196 - Contributes to "reduce per-operation allocations by 50%" goal > AI-generated content by Daily Perf Improver may contain mistakes.

claude added 2 commits August 29, 2025 18:19

Remove benchmark file from PR - keep only core implementation

6626a1c

github-actions bot mentioned this pull request Aug 29, 2025

Daily Perf Improver: Research and Plan #190

Closed

20 tasks

Merge branch 'main' into daily-perf-improver/fix-append-memory-leak

8366ae5

dsyme merged commit 9c681e1 into main Aug 29, 2025
1 check passed

github-actions bot mentioned this pull request Aug 29, 2025

Daily Perf Improver: Optimize unfoldAsync for better memory efficiency #196

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daily Perf Improver: Fix memory leak in append operations (Issue #35)#193

Daily Perf Improver: Fix memory leak in append operations (Issue #35)#193
dsyme merged 3 commits intomainfrom
daily-perf-improver/fix-append-memory-leak

github-actions bot commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

github-actions bot commented Aug 29, 2025

Summary

Performance Improvements

Technical Implementation

Root Cause

Solution

Code Changes

Validation

Test Plan

Related Issues

Commands Used

Web Searches Performed

MCP Function Calls Used

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants